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Abstract 

Background: The risk of influenza infection depends on biological characteristics, individual or collective behaviors 
and the environmental context. The Cohorts for Pandemic Influenza (CoPanFlu) France study was set up in 2009 
after the identification of the novel swine-origin A/H1N1 pandemic influenza virus. This cohort of 601 households 
(1450 subjects) representative for the general population aims at using an integrative approach to study the risk 
and characteristics of influenza infection as a complex combination of data collected from questionnaires regarding 
sociodemographic, medical, behavioral characteristics of subjects and indoor environment, using biological samples 
or environmental databases. 

Methods/Design: Households were included between December 2009 and July 2010. The design of this study 
relies on systematic follow-up visits between influenza seasons and additional visits during influenza seasons, when 
an influenza-like illness is detected in a household via an active surveillance system. During systematic visits, a nurse 
collects individual and environmental data on questionnaires and obtains blood samples from all members of the 
household. When an influenza-like-illness is detected, a nurse visits the household three times during the 12 
following days, and collects data on questionnaires regarding exposure and symptoms, and biological samples 
(including nasal swabs) from all subjects in the household. The end of the follow-up period is expected in fall 2012. 

Discussion: The large amount of data collected throughout the follow-up will permit a multidisciplinary study of 
influenza infections. Additional data is being collected and analyzed in this ongoing cohort. The longitudinal 
analysis of these households will permit integrative analyses of complex phenomena such as individual, collective 
and environmental risk factors of infection, routes of transmission, or determinants of the immune response to 
infection or vaccination. 
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Background 

The first human cases of influenza caused by a novel 
swine-origin A/H1N1 pandemic influenza virus variant 
(HINlpdm) were reported in Mexico and the United 
States in April 2009 [1]. Given the rapid spread of this 
virus and considering the likelihood of its pandemic ex- 
tent - confirmed by the World Health Organization 
(WHO) on June 11, 2009 [2] - the Cohorts for Pan- 
demic Influenza (CoPanFlu) international consortium 
was initiated to study individual and collective determi- 
nants of HINlpdm influenza infection across countries 
by setting up prospective cohorts of households, fol- 
lowed for 2 years in 6 countries or regions of the world: 
metropolitan France, Mali [3], Bolivia, Laos, Reunion Is- 
land [4] and Djibouti. The CoPanFlu-France cohort, set 
up in metropolitan general population, is part of the 
CoPanFlu international consortium and its protocol 
served as a blueprint for the other international cohorts. 

Several studies already reported risk factors for sea- 
sonal or pandemic influenza infection in households. 
These studies focused on including individual character- 
istics of index patients and their household contacts [5- 
10] or hygiene measures as predictors of secondary 
household infections [11,12]. In addition to household 
studies, risk factors of seasonal influenza infections have 
been studied in relation to characteristics of social con- 
tacts [13] or baseline serological status of the host [14]. 
However, to our knowledge, no attempt was made to 
study the risk of influenza infection as a complex com- 
bination of biological characteristics (including immun- 
ity), individual or collective behaviors and environmental 
context. This integrative approach, in which epidemio- 
logical data is comprehensively collected and analyzed, is 
currently developed for non-communicable diseases and 
relies on methods derived from Genome- Wide Associ- 
ation Studies (GWAS) [15,16]. To achieve our objec- 
tives, we developed a multidisciplinary approach, with 
an original design involving data collection on subjects 
and their environment and biological samples. 

Methods/Design 

Sampling 

This cohort was designed to assess the relative risk of in- 
fection by the HINlpdm virus. We first intended to in- 
clude 1000 households (about 2100 subjects) which 
would have permitted to detect covariates associated to 
a relative risk > 1.4 with a 80% power and 5% signifi- 
cance, assuming a cumulative incidence of 10% and 
intra-household correlation of 0.3. 

Households were sampled using a random tele- 
phonic procedure (Mitofsky-Waksberg design [17]) in 
a stratified geographical sampling scheme, aimed at in- 
cluding a sample of subjects as close as possible to the 
French general population [18,19]. Forty addresses 



were drawn from the national directory. These 
addresses defined the centers of 40 areas inside which 
subjects were eligible. The limits of these areas were 
defined as the smallest circle including 130,000 house- 
hold addresses in the public directory. The size of 
these areas varied (5 to 5000 km 2 ) according to popula- 
tion density (see Figure 1). In each area, two lists of 
households were drawn: 

• A "landline" list of 25 households: these households 
were chosen as those with the phone number 
immediately following a landline number drawn in 
this area. Since landline numbers are geographically 
allocated, this method ensured reaching households 
who chose not to be listed in the national directory. 

• A "mobile phone" list of 7 households: these 
households were directly drawn in the national 
directory, in order to reach households without a 
fixed phone. 

Addresses were iteratively drawn from the 40 lists of 
130,000 households each and these households were 
phoned to present the study and, upon meeting eligibil- 
ity criteria, were sent a written description of the study. 
A household was considered as "pre-included" when a 
referent member sent back a filled form to confirm his 
agreement. According to this method, 1,280 households 
were pre-included, i.e. agreed to be visited by a nurse for 
an inclusion visit involving all household members. We 
anticipated that 20% of pre-included households would 
finally decline to participate in the cohort. 

Eligibility criteria 

A household was defined as a person or group of people 
occupying the same domicile. All households were eli- 
gible to participate in the cohort, provided at least one 
member was over 18 years of age and French-speaking. 
A household member was defined as a person living at 
least half his/her time in the household. All household 
members were eligible, regardless of age. The inclusion 
of a household required the participation of all mem- 
bers: the refusal of one or more member(s) prevented 
the inclusion of other members. 

Participants 

Five thousand one hundred and two households were 
contacted by phone in order to achieve our targeted 
number of 1280 pre-included households (see Figure 2). 
The rate of contacted eligible households who agreed to 
be pre-included varied from 17% to 34% across the 40 
areas. The main reasons for non-participation were lack 
of time and expected difficulty to collect blood samples 
from children. 
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Figure 1 Distribution of included households in relation to density of population (total: N = 601). White discs represent the 40 areas of 
the study. Overlapping areas are merged. 



Six hundred and seven households were visited by a 
nurse for an inclusion visit, among which six finally did 
not agree to participate (refusal of at least one member 
after receiving more detailed information on the study). 
Data was collected on the 601 remaining households 
(1450 subjects). According to population census data 
[20], these households had sociodemographic character- 
istics close to the general population (see Additional file 1: 
Figure SI and Additional file 1: Tables S1-S6 for details). 

Data collection 

The main objective of this study was to identify individ- 
ual and collective determinants of HINlpdm infection; 
therefore we tried to collect comprehensive data about 
subjects and their environment, in addition to biological 
samples. Several household visits are carried on by 
nurses for this purpose (see Figure 3 for details). 

• Inclusion visits 

During the inclusion visit, nurses collected from 
all subjects detailed data regarding medical 
history, vaccination and preventive measures 
against influenza, smoking habits, socioeconomic 
status, risk perception and beliefs, frequency and 



characteristics of meetings with other people and 
housing (personal room, house or apartment). As 
the households' addresses were geocoded, we 
were able to get additional information from 
public databases regarding the immediate 
surrounding environment of households. An 
overview of data collected from questionnaires at 
entry in the cohort is shown in Figure 4. Blood 
samples were collected and centralized for 
serological analyses. For subjects over 10 years, a 
heparinated tube was also collected to study 
cellular immunity, as well as a blood sample 
dedicated to transcript analyses. 
Systematic yearly visits 

After the inclusion visit, systematic follow-up visits 
are carried on between influenza seasons. During a 
systematic visit, a nurse collects or updates 
individual and environmental data on 
questionnaires, completes previously missing data, 
and obtains blood samples from all members of the 
household. Two waves of systematic follow-up 
visits have already occurred (summer- fall 2010 and 
2011). A third wave is expected by the end of the 
second year of follow-up (summer 2012). 
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5102 households called to present 
the study 
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Pre-inclusion 




1280 households agreed to 


participate 





Inclusion visit 

607 households (1465 subjects) 
visited by a nurse 



Inclusion 

601 households (1450 subjects) 
included in the study (signed consent 
for all members) 



1 st systematic follow-up visit 

587 households (1414 subjects) 



2 nd systematic follow-up visit 

567 households (1360 subjects) 



Main reasons for not participating: 

• Refusal of blood sample collection in 

at least one household member 

Lack of time 



673 households not visited due to 
organizational constraints 



6 households with refusal of at least one 
member after new detailed information 
on the study 



36 subjects (14 full households + part of 
6 households) left the study before the 
1 st systematic follow-up visit: 12 subjects 
(5 households) lost to follow-up, 2 had 
died, 12 because of lacking time and 10 
for various personal reasons. 



54 subjects (20 households) left the 
study before the 2 nd systematic follow-up 
visit: 8 subjects (3 households) lost to 
follow-up, 1 had died, 31 because of 
lacking time and 14 for various personal 
reasons. 



Figure 2 Flow diagram until the 2 nd systematic follow-up visit. 



• Influenza-like illness (ILI) visits 

During the influenza season (as defined by the 
French surveillance network [21]), we use an 
active surveillance system order to detect ILIs: all 
households are called by an interactive voice 
response system (IVRS) weekly and are asked if 
any subject has symptoms of ILI (fever > 37.8°C 
associated with cough or sore throat, as defined 
by the CDC [22]). A free phone number is given 
to subjects to report symptoms spontaneously 
between two weekly calls. In case of reported ILI, 
symptoms are validated by the study team and 
then three "ILI visits" are organized: nurses visit 



the household within 48 h after the onset of 
symptoms, then 3-6 days and 8-12 days after the 
onset. 

• During these visits, a detailed questionnaire collects 
data about the circumstances of possible exposure to 
influenza viruses and the chronology of symptoms 
(if any) in all subjects. Nasal swabs are collected 
from all subjects. A stool sample and a throat swab 
are also collected from subjects with ILI, as well as a 
blood sample from those over 10 years of age. 
Moreover, a self-swab procedure is previously sent 
to the households in order to collect virological 
samples when a visit by a nurse within the first 48 h 
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December 2009 -Summer 2010 



Inclusion visit 



Winter 2009-2010 
Weekly surveillance ( I VRS) 



(questionnaires + blood samples) 



Summer-Fall 2010 
Follow-up visit 1 

(questionnaires + blood samples) 



Winter 2010-2011 
Weekly surveillance ( I VRS) 



Summer-Fall 2011 
Follow-up visit 2 

(questionnaires + blood samples) 



Winter 2011-2012 

Weekly surveillance (IVRS) 



Summer 2012 

Follow-up visit 3 

(questionnaires + blood samples) 



In case of influenza-like 
illness (I LI) in the household 

3 ILI visits in 2 weeks 

In all subjects: 

• Questionnaires 

• Nasal swab 

In subjects with ILI: 

• Throat swab 

• Stool sample 

• Blood sample (optional) 





Figure 3 Design of the study: systematic and additional visits of households by nurses. 



is not possible. Nasal swabs are used to identify 
various respiratory viruses by PCR and biochips 
allowing for multiple diagnosis tests. 

• This series of three visits can occur several times in 
the same household during an influenza season. 
There were 23 ILI alerts during the 2009-2010 
season (as households were still being included) and 
143 during the 2010-2011 season, all of which 
triggered up to three ILI visits. 

• Vaccination visits 

In order to update serological information, a 
blood sample was collected from subjects who 
had an influenza vaccination, between 2 and 
4 weeks following this vaccination. There was one 
vaccination visit following the inclusion visits; 29 
vaccination visits were conducted following the 
first wave of follow-up visits and 69 following the 
second wave. 

Timeline 

The cohort was initially designed to include households be- 
fore the 2009 pandemic season and to follow subjects dur- 
ing the two subsequent influenza seasons. We obtained 
funding in June 2009, the cohort protocol and question- 
naires were finalized in July 2009, and the protocol obtained 
ethical approval on September 8, 2009. Households were 



pre-included between September 25 and December 17, 
2009 and inclusion visits began on December 4, 2009 - as 
the final administrative authorizations were obtained. The 
inclusion period was extended until July 31, 2010, in order 
to get a relevant sample size for the planned analyses, some 
of which were postponed until the following season. A total 
of 575 households (96%) were included after the first pan- 
demic season (September 7 to December 27, 2009 [23]). 

Ethical considerations 

The protocol of the CoPanFlu-France study was 
approved by the research ethics committee "Comite de 
Protection des Personnes Ile-de-France 1" on September 
8, 2009. Information was previously given by investiga- 
tors to participants indirectly through written descrip- 
tions of the study and training of the nurses, and 
directly by e-mail and telephone for any question. Writ- 
ten informed consent was obtained for all subjects. 

Discussion 

Expected results 

Many analyses have been recently completed or are cur- 
rently being carried out. Based on inclusion data, we 
used a data-driven approach to identify factors asso- 
ciated with a high anti-HlNlpdm serological titer. We 
are conducting several analyses to identify risk factors 
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Sociodemographic characteristics and medical history 



Characteristics of the housing 



Sociodemographic characteristics 

• Age class (a) : <15 years, 15-50 years or 
>50 years 

• Sex = Male 

• Number of subjects in the household 

• Number of children in the household 

• Socio-professional category of the head 
of the family (a) : 

- Farmer, primary sector 

- Artisan, shopkeeper, chief executive 
officer 

- Executive, intellectual profession 

- Middle class 

- Employee 

- Working class 

- Retired 

- Other 

• Professional occupation 

- Work in confined space 

- Work outdoors 



- Work with children 

- Work with sick persons 

Medical history and habits 

• History of chronic disease 
Immune disorder 
Diabetes mellitus 
Chronic obstructive pulmonary 
disease 

History of asthma 
Current asthma 
High blood pressure 
HIV infection 
Hepatic disease 
Renal disease 
Respiratory disease 
Systemic disease 
Multiple sclerosis 
Other neurological or muscular 
disease 



- Heart failure 

- Cardiac arrhythmia 

- Coronary heart disease 

- History of cancer/hematological 
malignancy 

- Beta-thalassemia minor 

• Pregnancy 

• Current or former smoker 

• Current smoker 

• History of ILI for seasons 2006-2007 to 
2009-2010 

Medical history and habits in other 
subjects of the household 

• Current or former smokers 

• Current smokers 

• Histories of ILI for seasons 2006-2007 to 
2009-2010 



Geographic area 



Surrounding activity 

• Farm: 

- Pig farm 

- Poultry farm 

• Factory 

• Fields 

Demography 

• Size of urban area 
(inhabitants) 

• Proportion of Inhabitants by 
age class 

• Proportion of socio- 
professional groups 

_ • Rate of unemployment 



• Proportion of people using 
public transport to go to work 

• Proportion of households 
composed of: 

- Families with children 

- Families with > 2 children 

- Persons living alone 

• Types of dwellings 

• Proportion of dwellings 
owned by inhabitants 

• Proportion of social housing 

• Surface of dwellings 

• Proportion of migrants 

• Mean annual income 



Preventive measures 



Vaccination 

• Pandemic vaccination 

• Time since pandemic 
vaccination 

• Seasonal vaccination for 
seasons 2006-2007 to 2008- 
2009 

• Anti-pneumococcal 
vaccination 

Hygiene measures 

• Daily frequency of hands 
washing (with soap or hand 
sanitizer) 

• Cover coughs/sneezes 

• washes hands after 
coughing/sneezing 

• Uses facial tissues (vs. 
handkerchiefs) 



Vaccination in other subjects 
of the household 

• Pandemic vaccination 

• Seasonal vaccination for 
seasons 2006-2007 to 2008- 
2009 

Hygiene measures in other 
subjects of the 
household 

• Daily frequency of hands 
washing (with soap or hand 
sanitizer) 

• Always cover coughs/sneezes 

• Always or often wash hands 
after coughing/sneezing 




1 General characteristics 

- Type of dwelling 
(house/apartment) 

- Dwelling's area 

- Dwelling area per inhabitant 

- Number of rooms 

- Number of bathrooms 

- Number of toilets 

- Thermoregulator 

- Number of trash cans 

- Covers on trash cans 

- Frequency of waste 
collections 

- Garden's area 

• Presence of animals: 

- Dog(s) 

- Cat(s) 

- Rodent(s) 

- Bird(s) Pandemic vaccination 



' Kitchen: 

- Type of heating 

- Type of flooring 

- Area 

- Dishwasher 

- Hood 

' Living room: 

- Type of heating 

- Type of flooring 

- Area 

- Temperature 

- Air humidifier 

- Hood 

• Bedroom: 

- Type of heating 

- Type of flooring 

- Area 

- Windows orientation 

- Temperature 

- Air humidifier 

- Insulated glazing 



Attitudes, beliefs and risk perception 



Duration of 
meetings 

By location: 

• Professional occupation 

• At home 

• At school 

• At work 

• In transports 



By age of contacts: 

• 0-10 years 

• 10-20 years 

• 20-50 years 

• 50-70 years 

• >70 years 



Attitudes 

• In case of epidemic, I would intend to: 

- Wear masks in public 

- Take antivirals such as Tamiflu® 

- Get vaccinated against seasonal flu 

- Often washing your hands 

- Avoid public transports 

- Avoid public gatherings 

- Keep children away from school 

- Stay away from work 

I have already taken preventive measures 
against influenza 

Beliefs 

• Some people were infected by influenza A 
in France 

• Some people died from influenza A in 
France 

• Influenza A remains a mystery to me 

• I have a very clear idea about this disease 

• Mechanisms of the disease are not easily 
understood 

• Influenza A can be transmitted by: 

- Spit 

- Nasal mucus 

- Cough or sneeze 

- Contact of hands or cheeks 

- Inert objects 

- Pork consumption 

• Efficient measures to avoid being infected: 

- Wearing masks in public 

- Antivirals such as Tamiflu® 

- Vaccination against seasonal flu 
Hands washing 
Avoiding public transports 



- Avoiding public gatherings 

- Not sending children to school 

- Not going to work 

- Limiting traffic in infected areas 

- Forbidding street protests 

- Closing schools and preschools 

- Putting sick people in quarantine 

• All infected people have symptoms 

• Influenza A is a severe disease 

• Influenza A has a severe impact on bodily 
functions 

• Consequences of influenza A are totally 
unpredictable 

• Influenza A is most often fatal 

• I feel well informed regarding influenza 

• I am confident in authorities to mitigate an 
epidemic 

• I am confident in authorities to inform me 
about influenza A 

• Authorities hide important information 
about influenza A 

Risk perception 

• There's a high probability (>50%) of being 
infected: 

- If no preventive measure is taken 

- If preventive measures are taken 

• Some preventive measures can reduce the 
infection risk 

• Prevention against flu highly depends on 
behaviors 

• There's not much we can do to protect 
ourselves against influenza A 

• We can reduce the infection risk by taking 
personal measures 



Figure 4 Main data collected on questionnaires at entry in the cohort, in addition to blood samples. 



associated with influenza infections (based on serological 
data for the first pandemic season, then on both sero- 
logical and virological data for the following seasons). 
Nasal swabs are being analyzed to identify various re- 
spiratory viruses and the characteristics of infected sub- 
jects. Blood samples collected during ILI visits are used 
to study innate immunity against influenza and the 
related transcriptome. Determinants of vaccination 
against influenza have also been identified, and other 
studies are being conducted in the field of social science 
and risk perception. Several other analyses are expected 
soon from different collaboration partners in various 
biomedical fields. 



Strengths of the study 

The main strength of this cohort is the large amount of 
available and expected data and the different biological 
samples to be collected, which will permit to carry on 
many studies in various biomedical fields. To our know- 
ledge, this project is the first attempt to study so thor- 
oughly the determinants of infections by respiratory 
viruses in a large sample of households randomly 
selected in the community. This approach is likely to 
provide new insights from the interaction of sparse data 
usually studied separately, especially with the help of 
data-driven methods such as those already under devel- 
opment in the field of non-communicable diseases 
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[15,16]. Additional data is being collected and analyzed 
in this ongoing cohort, whose longitudinal analysis will 
permit integrative analyses of complex phenomena such 
as individual, collective and environmental risk factors 
of infection, routes of transmission, or determinants of 
the immune response to infection or vaccination. 

Limitations 

We designed the CoPanFlu-France study in order to en- 
able inference to the French general population, yet we 
cannot exclude a selection bias induced by the propor- 
tion of contacted households who refused to participate. 
However, a comparison between CoPanFlu subjects and 
population census data [20] suggests that this bias was 
controlled (see supplementary material part 2). 

We wish we were able to set up this project a few 
months earlier, in order to include households before 
and to follow-up subjects during the 2009 pandemic sea- 
son. Due to organizational impairments, the inclusion 
process was delayed and data regarding ILIs were col- 
lected retrospectively, sometimes up to 6 months after 
the epidemic. Thus, this timeline of inclusion may have 
induced recall or reporting biases for the 2009 season, 
and we were not able to collect enough pre-pandemic 
blood samples and nasal swabs during this first 
HINlpdm season to investigate laboratory-confirmed 
infections. Another consequence of this delayed inclu- 
sion process is that we decided to stop inclusions as only 
601 out of the 1000 expected households were included. 
This limit is the main reason why we decided to post- 
pone the end of the study until 2012 instead of 2011 as 
initially expected, in order to collect data during an add- 
itional influenza season. 

Additional file 



Additional file 1: Design and methods of the CoPanFlu-France 
cohort: representativeness of the population sample. 
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